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Abstract 

An w-language is a set of infinite words over a finite alphabet X. We consider the class 
of recursive w-languages, i.e. the class of cj-languages accepted by Turing machines with 
a Biichi acceptance condition, which is also the class Sj of (effective) analytic subsets of 
X u for some finite alphabet X. We investigate the notion of ambiguity for recursive u- 
languages with regard to acceptance by Biichi Turing machines. We first show that the class 
of unambiguous recursive w-languages is the class A} of hyperarithmetical sets. We obtain 
also that the A}-subsets of X u are the subsets of X" which are accepted by strictly recursive 
unambiguous finitely branching Biichi transition systems; this provides an effective analogue 
to a theorem of Arnold on Biichi transition systems [Arn83|. Moreover, using some effective 
descriptive set theory, we prove that recursive w-languages satisfy the following dichotomy 
property. A recursive w-language L C X" is either unambiguous or has a great degree of 
ambiguity in the following sense: for every Biichi Turing machine T accepting L, there exist 
infinitely many cj-words which have 2 K ° accepting runs by T. We also show that if L C X u 
is accepted by a Biichi Turing machine T and L is an analytic but non Borel set, then the set 
of w-words, which have 2 N ° accepting runs by T, has cardinality 2 N °. In that case we say 
that the recursive w-language L has the maximum degree of ambiguity. We prove that it is 
II2 -complete to determine whether a given recursive w-language is unambiguous and that it is 
Yj\ -complete to determine whether a given recursive w-language has the maximum degree of 
ambiguity. Moreover, using some results of set theory, we prove that it is consistent with the 
axiomatic system ZFC that there exists a recursive w-language in the Borel class 1I2, hence 
of low Borel rank, which has also this maximum degree of ambiguity. 

1998 ACM Subject Classification: F.l.l Models of Computation; F.4.1 Mathematical Logic. 

Keywords: Automata and formal languages; infinite words; Turing machines; Biichi transition systems; 
ambiguity; degrees of ambiguity; logic in computer science; effective descriptive set theory; models of set 
theory. 

1 Introduction 

Languages of infinite words, also called w-languages, accepted by finite automata were first 
studied by Biichi to prove the decidability of the monadic second order theory of one succes- 
sor over the integers. Since then regular w-languages have been much studied and many ap- 
plications have been found for specification and verification of non-terminating systems, see 
HTho901 ISta971 IPP041 for many results and references. Other finite machines, like pushdown 
automata, multicounter automata, Petri nets, have also been considered for the reading of infinite 
words, see llSTa^lEH931lFm06l . 
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Turing invented in 1937 what we now call Turing machines. This way he made a unique impact 
on the history of computing, computer science, and the mathematical theory of computability. 
Recall that the year 2012 is the Centenary of Alan Turing's birth and that many scientific events 
will commemorate this year Turing's life and work. 

Cohen and Gold studied in [CG78] the acceptance of infinite words by Turing machines via 
several acceptance conditions, like the Biichi or Muller ones. It turned out that the classes of u- 
languages accepted by Biichi or Muller acceptance conditions were the same class, the class of 
effective analytic sets ICG78llSta99ll 

We consider in this paper the class of recursive w-languages, i.e. the class of w-languages 
accepted by Turing machines with a Biichi acceptance condition, which is also the class Y,\ of 
(effective) analytic subsets of X u for some finite alphabet X. 

The notion of ambiguity is very important in formal language and automata theory and has 
been much studied for instance in the case of context-free finitary languages accepted by pushdown 
automata or generated by context-free grammars, [ABB97 ], and in the case of context-free u- 
languages, [Fin03l lFS03i In the case of Turing machines reading finite words, it is easy to see that 
every Turing machine is equivalent to a deterministic, hence also unambiguous, Turing machine. 
Thus every recursive finitary language is accepted by an unambiguous Turing machine. 

We investigate here the notion of ambiguity for recursive w-languages with regard to accep- 
tance by Biichi Turing machines, and we make use of results and methods of effective descriptive 
set theory to obtain our new results. 

Notice that this study may first seem to be of no practical interest, but in fact non-deterministic 
Turing machines over infinite data seem to be relevant to real-life algorithmics over streams, where 
non-determinism may appear either by choice or because of physical constraints and perturbation. 

We first show that the class of unambiguous recursive w-languages is the class A{ of hyper- 
arithmetical sets. On the other hand, Arnold studied Biichi transition systems in [Arn83]. In 
particular, he proved that the analytic subsets of X w are the subsets of X^ which are accepted by 
finitely branching Biichi transition systems, and that the Borel subsets of X^ are the subsets of X^ 
which are accepted by unambiguous finitely branching Biichi transition systems. Some effective 
versions of Biichi transition systems were studied by Staiger in [Sta93|. In particular, he proved 
that the subsets of X w which are accepted by strictly recursive finitely branching Biichi transition 
systems are the effective analytic subsets of X^ . We obtain also here that the Aj-subsets of X^ 
are the subsets of X u which are accepted by strictly recursive unambiguous finitely branching 
Biichi transition systems. This provides an effective analogue to the above cited result of Arnold. 

Next, we prove that recursive w-languages satisfy the following dichotomy property. A re- 
cursive w-language L C X w is either unambiguous or has a great degree of ambiguity: for every 
Biichi Turing machine T accepting L, there exist infinitely many w-words which have 2 H ° accept- 
ing runs by T. 

We also show that if L C X u is accepted by a Biichi Turing machine T and L is an analytic 
but non Borel set, then the set of w-words, which have 2 N ° accepting runs by T, has cardinality 
2 X °. This extends a similar result of MFS031 in the case of context-free w-languages and infinitary 
rational relations. In that case we say that the recursive w-language L has the maximum degree of 
ambiguity. 

Castro and Cucker studied decision problems for w-languages of Turing machines in [CC89 ]. 
They gave the (high) degrees of many classical decision problems like the emptiness, the finiteness, 
the cofiniteness, the universality, the equality, the inclusion, problems. In [Fin09b] we obtained 
many new high undecidability results about context-free w-languages and infinitary rational rela- 
tions. We prove here new high undecidability results about ambiguity of recursive o;-languages: 
it is n^-complete to determine whether a given recursive w-language is unambiguous and it is 
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Sg-complete to determine whether a given recursive w-language has the maximum degree of am- 
biguity. 

Then, using some recent results from IFin09a] and some results of set theory, we prove that 
it is equiconsistent with the axiomatic system ZFC that there exists a recursive w-language in the 
Borel class III], hence of low Borel rank, which has also the maximum degree of ambiguity. 

The paper is organized as follows. We recall some known notions in Section 2. We study 
unambiguous recursive ^languages in Section 3 and inherently ambiguous recursive o;-languages 
in Section 4. Some concluding remarks are given in Section 5. 

2 Reminder of some well-known notions 

We assume the reader to be familiar with the theory of formal (w-)languages [Sta97, PP04]. 
We recall the usual notations of formal language theory. 

If £ is a finite alphabet, a non-empty finite word over £ is any sequence x = a\ . . . a^, where 
ctj G £ for i = 1, . . . , k , and k is an integer > 1. The length of x is k, denoted by \x\. The empty 
word has no letter and is denoted by A; its length is 0. £* is the set of finite words (including the 
empty word) over £. A (fmitary) language V over an alphabet £ is a subset of £*. 

The first infinite ordinal is ui. An uj-word over £ is an o;-sequence a\ . . . a n . . ., where for all 
integers i > 1, a, G E. When a = a\ . . . a n . . . is an w-word over £, we write a(n) = a n , 
a[n] = cr(l)cr(2) . . . cr(n) for all n > 1 and <r[0] = A. 

The concatenation of two finite words u and v is denoted u ■ v (and sometimes just uv). This 
operation is extended to the product of a finite word u and an w-word v: the infinite word u ■ v is 
then the w-word such that: 

(u ■ v)(k) = u(k) if k < \u\ , and (u ■ v)(k) = v(k — \u\) if k > \u\. 

The set of uj-words over an alphabet S is denoted by An co-language V over an alphabet 
E is a subset of E^, and its complement (in E w ) is E^ — V, denoted V~. 

We assume the reader to be familiar with basic notions of topology, which may be found in 
HKec951 ILT941 |Sta97[ IPP04L There is a natural metric on the set E w of infinite words over a 
finite alphabet E containing at least two letters, which is called the prefix metric, and is defined as 
follows. For u, v G E w and u ^ v let 5(u, v) = 2~ l p iat( - u ^ where £ pre f( u , v ) is the first integer n 
such that the (n + l) st letter of u is different from the (n + l) st letter of v. This metric induces 
on E w the usual Cantor topology in which the open subsets of E w are of the form W ■ E w , for 
W C E*. A set L C E^ is a closed set iff its complement E w — L is an open set. 

We now recall the definition of the Borel Hierarchy of subsets of X w . 

Definition 2.1 For a non-null countable ordinal a, the classes S^, and 11^ of the Borel Hierarchy 

on the topological space are defined as follows: E^ 1 is the class of open subsets ofX u , n? is 

the class of closed subsets of X u , and for any countable ordinal a > 2: 

S° is the class of countable unions of subsets of X^ in U 7 < a 

11^ is the class of countable intersections of subsets of X^ in U 7 < Q £7- 

A set L C X w is Borel iff it is in the union Ua< W i = U a < wi n„, where to\ is the first 
uncountable ordinal. 

For a countable ordinal a, a set L C X^ is a Borel set of rank a iff it is in E^ U 11^ but not 

mU 7<a (s 7 un°). 

There are also some subsets of X u which are not Borel. In particular the class of Borel subsets of 
X u is strictly included into the class S} of analytic sets which are obtained by projection of Borel 
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sets. The co-analytic sets are the complements of analytic sets. 



Definition 2.2 A subset A of X 1 ^ is in the class XI } of analytic sets iff there exist a finite alphabet 
Y and a Borel subset B of (X x Y) u such that x G A O 3y G Y u such that (x, y) G B, where 
(x,y) is the infinite word over the alphabet X x Y such that (x,y)(i) = (x(i),y(i)) for each 
integer i > 1. 

We now define completeness with regard to reduction by continuous functions. For a countable 
ordinal a > 1, a set F C X w is said to be a £° (respectively, 11° , H\)-complete set iff for any 
set E <ZY U (with V a finite alphabet): E G S° (respectively, E 1 G 11°, £ G iff there exists 
a continuous function / : Y w ->• I w such that £ = f~ x (F), i.e. such that (z G £ iff /(x) G F). 

We now recall the definition of the arithmetical hierarchy of w-languages which form the 
effective analogue to the hierarchy of Borel sets of finite ranks, see |5ta97'l. 

Let X be a finite alphabet. An w-language L C X w belongs to the class S n if and only if 
there exists a recursive relation Rl C (N) n_1 x X* such that 

L = {a G X^ | 3h . . . Q n k n (fcj, . . . , fc n _i, a[A; n + 1]) G R L } 

where Qi is one of the quantifiers V or 3 (not necessarily in an alternating order). An o;-language L C 
X^ belongs to the class U n if and only if its complement X^ — L belongs to the class S n . The 
inclusion relations that hold between the classes E n and Il n are the same as for the corresponding 
classes of the Borel hierarchy. The classes E n and U n are included in the respective classes S° 
and Tl° n of the Borel hierarchy, and cardinality arguments suffice to show that these inclusions are 
strict. 

As in the case of the Borel hierarchy, projections of arithmetical sets lead beyond the arith- 
metical hierarchy, to the analytical hierarchy of o;-languages. The first class of this hierarchy is the 
(lightface) class of effective analytic sets which are obtained by projection of arithmetical sets. 
In fact an w-language L C X^ is in the class iff it is the projection of an w-language over the 
alphabet X x {0, 1} which is in the class II2. The (lightface) class n{ of effective co-analytic sets 
is simply the class of complements of effective analytic sets. We denote as usual A\ = Yl\ n n^. 

The (lightface) class T,\ of effective analytic sets is strictly included into the (boldface) class 
Yl\ of analytic sets. 

We assume the reader to be familiar with the arithmetical and analytical hierarchies on sub- 
sets of N, these notions may be found in the textbooks on computability theory [Rog67 ] [Odi89, 
IOdi99i 

We shall consider in the sequel some £} or n} subsets of product spaces like X w x Y^ or 
N x Moreover, in effective descriptive set theory one often considers the notion of relativized 
class T,\(w): for w G a set L C Y" is a £}(u>)-set iff there exists a S}-set T C X w x Y w 
such that L = {y G Y w \ (w, y) G T}. A set L C Y w is a n}(t<;)-set iff its complement is a 
Ei(io)-set. A set L C Y w is a A\(w)-set iff it is in the class T,\(w) n U{(w). We say that y £Y U 
is in the class A} (respectively, A\(w)) iff the singleton {y} is a A}-set (respectively, A}(u>)-set). 

Recall now the notion of acceptance of infinite words by Turing machines considered by Cohen 
and GoldinflCGTHl. 

Definition 2.3 A non deterministic Turing machine M. is a 5-tuple M = (Q, "S, T, 5, qo), where 
Q is a finite set of states, S is a finite input alphabet, T is a finite tape alphabet satisfying S C T, go 
is the initial state, and 5 is a mapping from Q xT to subsets ofQ x T x {L, R, S}. A configuration 
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of M is a triplet (q, a, i), where q £ Q, a £ T w and i £ N. An infinite sequence of configurations 
r = (qi, cti,ji)i>i is called a run of M. onw £ T,^ iff: 

(a) (q 1 ,a 1 ,ji) = (q ,w,l),and 

(b) for each i > 1, (g^a;, jj) h (q i+ i,a i+1 ,j i+1 ), 

where h ?J f/ze transition relation of A4 defined as usual. The run r is said to be complete if every 
position is visited, i.e. if (Vn > l)(3k > > n). 77ze ran r jj sa/<i to fte oscillating if some 

position is visited infinitely often, i.e. if (3k > l)(Vn > l)(3m > n)(j m = k). 

Definition 2.4 Let A4 = (Q,T,,T,5,qo) be a non deterministic Turing machine and F C Q, 
J 7 C 2*5. 77ie co-language 1' -accepted (respectively, 2-accepted) by (M, F) is the set of u-words 
a £ S w swc/i f/iaf there exists a complete non oscillating run r = a«, ji)i>\ of M on a such 
that, for all i,qi £ F (respectively, for infinitely many i, qi £ F). The co-language 3-accepted by 
(A4,J-) is the set of co-words a £ S w such that there exists a complete non oscillating run r of M 
on a such that the set of states appearing infinitely often during the run r is an element of T. 

The l'-acceptance condition is also considered by Castro and Cucker in IICC891 . The 2-acceptance 
and 3-acceptance conditions are now usually called Biichi and Muller acceptance conditions. Co- 
hen and Gold proved the following result in IICG781 Theorem 8.2]. 

Theorem 2.5 (Cohen and Gold | CG78 1) An co-language is accepted by a non deterministic Tur- 
ing machine with 1 ' -acceptance condition iff it is accepted by a non deterministic Turing machine 
with Biichi ( or Muller) acceptance condition. 

Notice that this result holds because Cohen and Gold's Turing machines accept infinite words 
via complete non oscillating runs, while 1', Biichi or Muller acceptance conditions refer to the 
sequence of states entered during an infinite run. For other approaches, acceptance is based only 
on the sequence of states entered by the machine during an infinite computation [Sta97], or one 
requires also that the machine reads the whole infinite tape [EH93]. See [SW78, ISta991 IFSOO, 
StaOO] for a study of these different approaches. 

From now on we shall consider Turing machines accepting w-words via acceptance by com- 
plete runs (i.e., not necessarily non oscillating). By [Sta99, Theorem 16] (see also UStaOOl The- 
orem 5.2]) we have the following characterization of the class of w-languages accepted by these 
non deterministic Turing machines. 

Theorem 2.6 (|Sta99|) The class of co-languages accepted by non deterministic Turing machines 
with 1' (respectively, Biichi, Muller) acceptance condition is the class T>\ of effective analytic sets. 

In the sequel we shall also restrict our study to the Biichi acceptance condition. But all the 
results of this paper can easily seen to be true for any other acceptance condition leading to the 
class T>\ of effective analytic sets. For instance it follows from MCG781 Note 2 page 12] and from 
Theorem 12.61 that the class of w-languages accepted by Cohen's and Gold's non deterministic Tur- 
ing machines with 1' (respectively, Biichi, Muller) acceptance condition is the class T,\. Moreover 
the class is also the class of o;-languages accepted by Turing machines with Biichi acceptance 
condition if we do not require that the Turing machine reads the whole infinite tape but only that 
it runs forever, [Sta97 |. 

Due to the above results, we shall say, as in [Sta97 ], that an w-language is recursive iff it 
belongs to the class Y,\. Notice that in another presentation, as in [Rog67], the recursive cj- 
languages are those which are in the class Si n fli, see also HLT94H . 
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3 Unambiguous recursive w-languages 



In the sequel a Biichi Turing machine will be a Turing machine reading infinite words and accept- 
ing w-words with a Biichi acceptance condition via complete runs. 

Definition 3.1 A Biichi Turing machine Ai reading oj-words over an alphabet £ is said to be 
unambiguous iff every u-word x € S w has at most one accepting run by A4. 

Definition 3.2 Let S be a finite alphabet. A recursive uj-language L C S w is said to be unam- 
biguous iff it is accepted by (at least) one unambiguous Biichi Turing machine. Otherwise the 
recursive uo-language L is said to be inherently ambiguous. 

We can now state our first result. 

Proposition 3.3 If ' £ is a finite alphabet and L C S w is an unambiguous recursive u-language 
then L belongs to the (effective) class A}. 

Proof. Let L C S w be an o;-language accepted by an unambiguous Biichi Turing machine 
(A4, F), where M. = (Q, £, T, 5, qo) is a Turing machine and F C Q. Recall that a configuration 
of the Turing machine M. is a triple (q, a, i), where q G Q, a G T^ and i 6 N. It can be coded by 
the infinite word q l ■ a over the alphabet Q U T, where we have assumed without loss of generality 
that Q and F are disjoint. Then a run of M on w G S w is an infinite sequence of configurations 
r = (qi, cti,ji)i>i which is then coded by an infinite sequence of w-words (rj)j>i = [qf ■ cti)i>\ 
over Q UT. Using now a recursive bijection b : (N \ {0}) 2 — > N \ {0} and its inverse b~ l we 
can effectively code the sequence (rj)j>i by a single infinite word r' G (Q U T)^ defined by: 
for every integer j > 1 such that b~ l (j) = (11,12), f'(j) = r^^)- Moreover the infinite word 
r' £ (Q U T) w can be coded in a recursive manner by an infinite word over the alphabet {0, 1}. 
We can then identify r with its code f € {0, 1}^ and this will be often done in the sequel. Let now 
R be defined by: 

R = {(w, r) I w € S w and r € {0, 1} W is an accepting run of (M., F) on the o;-word w}. 

The set R is a Aj-set, and even an arithmetical set: it is easy to see that it is accepted by a 
deterministic Muller Turing machine and thus it is a Ag-subset of the space (S x {0, 1})^, see 
llSta97l . 

Consider now the projection PROJ s ^ : S w x {0, 1} U ->■ defined by PROJ s ^ (w, r) = w 
for all (w, r) € S w x {0, This projection is a recursive function, i.e. "there is an algorithm 
which given sufficiently close approximations to (w, r) produces arbitrarily accurate approxima- 
tions to PROJs" (w, r)", see IMos091 . Moreover it is injective on the Aj^-set R because the Biichi 
Turing machine (M., F) is unambiguous. But the image of a A}-set by an injective recursive func- 
tion is a A^-set, see MMos091 page 169] and thus the recursive w-language L = PROJs^(i?) is a 
A} -subset of S w . □ 

In order to prove a converse statement we now first recall the notion of Biichi transition system. 

Definition 3.4 A Biichi transition system is a tuple T = (£, Q, 5,qo,Q f), where S is a finite input 
alphabet, Q is a countable set of states, 5CQxY>xQis the transition relation, qo 6 Q is the 
initial state, and Qf C Q is the set of final states. A run of T over an infinite word a € S w is an 
infinite sequence of states (ti)i>Q, such that to = qQ, and for each i > 0, (ti, a(i + 1), tj+i) € 5. 
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The run is said to be accepting iff there are infinitely many integers i such that tj is in Q f. An 
co-word a £ ^ is accepted by T iff there is (at least) one accepting run of T over a. The co- 
language L(T) accepted by T is the set of co-words accepted by T. The transition system is said to 
be unambiguous if each infinite word a € S w has at most one accepting run by 7~. The transition 
system is said to be finitely branching if for each state q 6 Q and each a € X, there are only 
finitely many states q' such that (q, a, q') € 5. 

Arnold proved the following theorem in [Arn83]. 

Theorem 3.5 Let E be an alphabet having at least two letters. 

1. The analytic subsets ofT,^ are the subsets ofY? which are accepted by finitely branching 
BUchi transition systems. 

2. The Borel subsets of S w are the subsets of S w which are accepted by unambiguous finitely 
branching BUchi transition systems. 

It is also very natural to consider effective versions of Biichi transition systems where the sets 
Q, 5, and Qj are recursive. Such transition systems are studied by Staiger in [Sta93 ] where Q is 
actually either the set N of natural numbers or a finite segment of it, and they are called strictly 
recursive. It is proved by Staiger that the subsets of S w which are accepted by strictly recursive 
finitely branching Biichi transition systems are the effective analytic subsets of E w . 

On the other hand, the Biichi transition systems are considered by Finkel and Lecomte in 
MFL091 where they are used in the study of topological properties of w-powers. Using an effective 
version of a theorem of Kuratowski, it is proved in MFL09I that every A{-subset of {0, 1}^ is 
actually accepted by an unambiguous strictly recursive finitely branching Biichi transition system 
(where the degree of branching of the transition system is actually equal to 2). Using an easy 
coding this is easily extended to the case of any A^-subset of where £ is a finite alphabet. 

Using the fact that every recursive set of finite words over a finite alphabet T is accepted by a 
deterministic hence also unambiguous Turing machine reading finite words, we can easily see that 
every w-language which is accepted by an unambiguous strictly recursive finitely branching Biichi 
transition system is also accepted by an unambiguous Biichi Turing machine. This shows that the 
converse of Proposition [33]holds and thus we have the following characterization of unambiguous 
recursive ^-languages. 

Theorem 3.6 Let £ be an alphabet having at least two letters. An co-language L C T, w is an 
unambiguous recursive co-language iff L belongs to the (effective) class A\. 

Notice that we have also the effective analogue to Arnold's Theorem 13 .5 1 
Theorem 3.7 Let S be an alphabet having at least two letters. 

1. The effective analytic subsets of S w are the subsets of T,^ which are accepted by strictly 
recursive finitely branching Biichi transition systems. 

2. The A\-subsets ofT,^ are the subsets ofT^ which are accepted by strictly recursive unam- 
biguous finitely branching Biichi transition systems. 

Proof. Item 1 is proved in [Sta93]. To prove that every ^-language which is accepted by a strictly 
recursive unambiguous finitely branching Biichi transition system is a A^-set we can reason as 
in the case of Turing machines (see the proof of Proposition 13.31) . As said above, the converse 
statement is proved in IIFL09I . □ 
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4 Inherently ambiguous recursive to -languages 



The notion of ambiguity for context-free w-languages has been studied in [Fin031 IFS03i In partic- 
ular it was proved in [FS03 ] that every context-free w-language which is non-Borel has a maximum 
degree of ambiguity. This was proved by stating firstly a lemma, using a theorem of Lusin and 
Novikov. We now recall this lemma and its proof. 

Lemma 4.1 (|FS03 |) Let T, and X be two finite alphabets having at least two letters and B be a 
Borel subset ofTf x X u such that PROJ^>(i3) is not a Borel subset ofTf. Then there are 2^° 
co-words a G S w such that the section B a = {/3 G X u \ (a, f3) G B} has cardinality 2^°. 

Proof. Let £ and X be two finite alphabets having at least two letters and B be a Borel subset of 
S w x X" such that PRO J s - (B) is not Borel. 

In a first step we prove that there are uncountably many a G such that the section B a 
is uncountable. Recall that by a Theorem of Lusin and Novikov, see [Kec95, page 123], if for 
all a G the section B a of the Borel set B was countable, then PROJs^(-B) would be a 
Borel subset of Thus there exists at least one a G S w such that B a is uncountable. In 
fact we have not only one a such that B a is uncountable. For a G S w we have {a} x B a = 
B n [{a} x But {a} x X w is a closed hence Borel subset of S w x X w thus {a} x B a is 
Borel as intersection of two Borel sets. If there was only one a G S w such that B a is uncountable, 
then C = {a} x B a would be Borel so D = B — C would be borel because the class of Borel sets 
is closed under boolean operations. But all sections of D would be countable thus PROJs"(.D) 
would be Borel by Lusin and Novikov's Theorem. Then PROJs^S) = {a} U PROJ S "(-D) 
would be also Borel as union of two Borel sets, and this would lead to a contradiction. In a similar 
manner we can prove that the set U = {a G S w | B a is uncountable } is uncountable, otherwise 
U = {ao, ai,... a n , . . .} would be Borel as the countable union of the closed sets {a{\, i > 0. 
For each n > the set {a n } x B an would be Borel, and C = U n£w {a n } x B an would be 
Borel as a countable union of Borel sets. So D = B — C would be borel too. But all sections 
of D would be countable thus PRO Js^ (D) would be Borel by Lusin and Novikov's Theorem. 
Then PRO Js^ (B) = U U PRO Js" (D) would be also Borel as union of two Borel sets, and this 
would lead to a contradiction. So we have proved that the set {a 6 E u B a is uncountable } is 
uncountable. 

On the other hand we know from another Theorem of Descriptive Set Theory that the set 
{a £ B a is countable } is a n|-subset of see | Kec95 | page 123]. Thus its comple- 

ment {a £ B a is uncountable } is analytic. But by Suslin's Theorem an analytic subset 

of S w is either countable or has cardinality 2 K °, [Kec95, p. 88]. Therefore the set {a £ 
B a is uncountable } has cardinality 2 K °. Recall now that we have already seen that, for each 
a G the set {a} x B a is Borel. Thus B a itself is Borel and by Suslin's Theorem B a is either 
countable or has cardinality 2 K °. From this we deduce that {a G S w | B a is uncountable } = 
{a G S w | B a has cardinality 2 K °} has cardinality 2 Ko . □ 

We can now apply this lemma to the study of ambiguity of Turing machines, in a similar way 
as in BFS03H for context-free ^-languages. We can now state the following result. 

Theorem 4.2 Let L C S w be an u-language accepted by a Biichi Turing machine (M,F) such 
that L is an analytic but non Borel set. The set of co-words, which have 2^° accepting runs by 
(M, F), has cardinality 2 H °. 

Proof. Let L C T, u be an analytic but non-Borel w-language accepted by a Biichi Turing machine 
(M,F), where M. = (Q, E, T, S, go) is a Turing machine and F C Q. As in the proof of 
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Proposition 13 . 3 1 we consider the set R defined by: 

R = {(w, r) | w £ S u and r G {0, 1}^ is an accepting run of {M.,F) on the w-word w}. 

The set R is a A \ -set, and thus it is a Borel subset of S w x {0, But by hypothesis the set 
PROJs>(i?) = L is not Borel. Thus it follows from Lemma |4~T1 that the set of a;- words, which 
have 2 N ° accepting runs by (M,F), has cardinality 2 H ° . □ 

We now know that every recursive ^-language which is non-Borel has a maximum degree of 
ambiguity On the other hand Proposition 13.31 states that every recursive w-language which does 
not belong to the (effective) class A{ is actually inherently ambiguous. In fact we can prove a 
stronger result, using the following effective version of a theorem of Lusin and Novikov: 

Theorem 4.3 ( see 4.F.16 page 195 of |Mos09 | ) Let £ and X be two finite alphabets having at 
least two letters and B be a A\-subset ofT, w x X u such that for all a G S w the section B a = 
{[3 G X w | (a, /3) G B} is countable. Then the set PROJj> (B) is also a A\-subset ofH u . 

We can now state the following result. 

Theorem 4.4 Let L C S w be an co-language accepted by a Biichi Turing machine (A4,F) such 
that L is not a A\-set. Then there exist infinitely many uj-words which have 2 N ° accepting runs by 
(M, F). 

Proof. Let L C S w be an w-language which is not a A{-set and which is accepted by a Biichi 
Turing machine (A4,F), where M. = (Q, E, T, 5, qo) is a Turing machine and F C Q. As in the 
proof of Proposition 13 . 3 1 we consider the set R defined by: 

R = {(w, r) | w G S w and r G {0, 1}" is an accepting run of (A4,F) on the w-word w}. 

The set of accepting runs of (M, F) on an w-word w G H w is the section 

Rw = {re {0,1}" | (w,r) G R}. 

We have seen that the set R is a Aj-set hence also a Sj-set, and thus for each w-word w G S w the 
set R w is in the relativized class Y\ (w). On the other hand it is known that a Sj; (w;)-set is countable 
if and only if all of its members are in the class A\(w), see |Mos09 ( page 184]. Therefore the set 
R w is countable iff for all r G R w r G A\(w). Notice also that R w is an analytic set thus it is 
either countable or has the cardinality 2 X ° of the continuum. 

Recall that Harrington, Kechris and Louveau obtained a coding of A}-subsets of {0, 1}" in 
[HKL90]. Notice that in the same way they obtained also a coding of the A\ (u>)-subsets of {0, 1} U 
which we now recall. 

For each w G S w there esists a n}(-u;)-set W(w) C N and a nj(w)-set C(w) C N x {0, 1} U 
such that, if we denote C n (w) = {x G {0, 1} U \ (n, x) G C(w)}, then {(n, a) G N x {0, 1} W | 
n G W(w) and a ^ C n (w)} is a IlJ(t(;)-subset of the product space N x {0, 1} W and the A}(u;)- 
subsets of {0, 1} W are the sets of the form C n (w) for n G W{w). 

We can now express [(3n G W(w)) C n (w) = {x}] by the sentence c/)(x, w): 
3n[n G W(w) and (n, x) G C(w) and Vy G {0, 1} W [(n G VF(w) and (n, y) £ C(w)) or (y = x)]] 
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But we know that C{w) is a IlJ(w)-set and that {(n, a) G N x {0, 1} W | n G W(w) and a 
C n (w)} is a n}(u>)-subset of N x {0, 1} W . Moreover the quantification 3n in the above formula 
is a first-order quantification therefore the above formula cj)(x, w) is a IlJ-formula. We can now 
express that R w is countable by the sentence tp(w) : 

Vx G {0, 1} W [(x i R w ) or (3n G W(w) C n (w) = {x})] 

that is, 

VxG{0,l} w [(x^^ar^u;)] 
This is a n}-formula thus i?^, is uncountable is expressed by a S}-formula and thus the set 

D = {w | w G S w and there are uncountably many accepting runs of (At, F) on u;}. 

is a Ej-set. 

Towards a contradiction, assume now that the set D is finite. Then for every x G D the 
singleton {x} is a Aj-subset of {0, 1} U because D is a countable Sj-set. But -D is finite so it 
would be the unuion of a. finite set of A]; -sets and thus it would be also a A} -set. Consider now 
the set R' = R\ (D x {0, 1} W ). This set would be also a A \ -set and PROJ s - (R f ) = L\D would 
not be in the class A{ because by hypothesis L is not a A}-set. But then we could infer from 
Theorem 14.31 that there would exist an o;-word w G L\ D having uncountably many accepting 
runs by the Biichi Turing machine (At, F). This is impossible by definition of D and thus we can 
conclude that D is infinite, i.e. that there exist infinitely many w-words which have uncountably 
many, or equivalently 2^°, accepting runs by (At, F). □ 

Remark 4.5 We can not obtain a stonger result like "there exist 2^° u-words which have 2^° 
accepting runs by (At, F)" in the conclusion of the above Theorem \4.4\ because there are some 
countable subsets ofE u which are in the class T,\ \ A}. 

Remark 4.6 The result given by Theorem \4.4\ is a dichotomy result for recursive oj-languages. A 
recursive ^-language L is either unambiguous or has a great degree of ambiguity: for every Biichi 
Turing machine (At, F) accepting it there exist infinitely many uj-words which have 2 N ° accepting 
runs by (At,-F). This could be compared to the case of context-free u-languages accepted by 
Biichi pushdown automata: it is proved in [Fin03 ] that there exist some context-free uo-languages 
which are inherently ambiguous of every finite degree n > 2 ( and also some others of infinite 
degree). 

There are many examples of recursive w-languages which are Borel and inherently ambiguous 
of great degree since there are some sets which are (T,\ \ Aj)-sets in every Borel class S° or 11°. 
On the other hand recall that Kechris, Marker and Sami proved in [KMS89] that the supremum 
of the set of Borel ranks of (effective) S}-sets is the ordinal 73. This ordinal is precisely defined 
in [KMS89] where it is proved to be strictly greater than the ordinal 5\ which is the first non A3, 
ordinal. In particular it holds that ojf K < 72, where ojf K is the first non-recursive ordinal. On the 
other hand it is known that the ordinals 7 < wp K are the Borel ranks of (effective) A{-sets. Thus 
we can state the following result. 

Proposition 4.7 If E is a finite alphabet and L C S w is a recursive u-language which is Borel 
of rank a greater than or equal to the ordinal K then for every Biichi Turing machine (At, F) 
accepting it there exist infinitely many ui-words which have 2 accepting runs by (At , F). 
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Notice that this can be applied in a similar way to context-free w-languages accepted by Biichi 
pushdown automata and to infinitary rational relations accepted by Biichi 2-tape automata, where 
ambiguity refers here to acceptance by these less powerful accepting devices, see [Fin03l IFS03II . 
If L C S w is a context-free w-language (respectively, L C S u x P is an infinitary rational 
relation) which is Borel of rank a greater than or equal to the ordinal oj± k then L is an inherently 
ambiguous context-free w-language (respectively, infinitary rational relation) of degree 2^° as 
defined in llFin03llFS03ll . 

We have established in Theorem 14.21 that if L C S w is an w-language accepted by a Biichi 
Turing machine (M,F) such that L is an analytic but non Borel set, then the set of w-words, 
which have 2 N ° accepting runs by (M,F), has cardinality 2^°. It is then very natural to ask 
whether this very strong ambiguity property is characteristic of non Borel recursive w-languages 
or if some Borel recursive w-languages could also have this strongest degree of ambiguity. We are 
going to see that it is consistent with ZFC that some w-languages in the Borel class IlSj, hence of a 
low Borel rank, can have this maximum degree of ambiguity. We first formally define this notion. 

Definition 4.8 Let £ be a finite alphabet and L C S w be a recursive oj-language. Then the uj- 
language L is said to have the maximum degree of ambiguity if, for every Biichi Turing machine 
(Ai , F) accepting L, the set of u-words, which have 2^° accepting runs by (M , F), has cardi- 
nality 2 °. The set of recursive uj-languages having the maximum degree of ambiguity is denoted 
Max-Amb. 

We are firstly going to state some high undecidability properties. Recall that a Biichi Turing 
machine has a finite description and thus one can associate in a recursive and injective manner a 
positive integer z to each Biichi Turing machine T. The integer z is then called the index of the 
machine T and we have a Godel numbering of the Biichi Turing machines. In the sequel the Biichi 
Turing machine of index z, reading words over the alphabet Y = {a, b}, will be denoted T z . 

We recall the notions of 1 -reduction and of £* -completeness (respectively, n^-completeness) 
for subsets of N (or of N z for some integer I > 2). Given two sets A, B C N we say A is 1- 
reducible to B and write A <\ B if there exists a total computable injective function / from N 
to N with A = f~ l [B\. A set A C N is said to be £* -complete (respectively, -complete) iff 
A is a S^-set (respectively, Il^-set) and for each S^-set (respectively, n^-set) B C N it holds 
that B <i A. It is known that, for each integer n > 1, there exist some S^-complete and some 
n^-complete subsets of N; some examples of such sets are described in | Rog67[ICC89i 



Theorem 4.9 The unambiguity problem for co-languages of Biichi Turing machines is YV^-complete, 
i.e. : The set {z E N | L(T Z ) is non ambiguous } is H^-complete. 

Proof. We can first express "T z is non ambiguous" by : 



"Vx E T w Vr, r' € {0, 1} U [(r and r are accepting runs of T z on x) 



r = r'l" 



which is a Il}-formula. Then "L(T Z ) is non ambiguous" can be expressed by the following for- 
mula: "3y[L(T z ) = L{T y ) and (T y is non ambiguous)]". This is a n^-formula because "L(T Z ) = 
L(T y )" can be expressed by the IT^ -formula 

"Vx G T w [(x G L(T Z ) and x G L(T y )) or (x £ L(T Z ) and x £ L(T y ))]", 

and the quantification 3y is a first-order quantification bearing on integers. Thus the set {z G N | 
L(T Z ) is non ambiguous } is a fl^-set. 
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To prove completeness we use a construction we already used in [Fin09b|. We first define 
the following operation on w-languages. For x, x' G T u the w-word x ® x' is defined by: for 
every integer n > 1 (x ® x')(2n — 1) = x(n) and (x ® x')(2n) = x'{n). For two w-languages 
L, L' C r w , the w-language L ® L' is defined by L ® L' = {x ® x' [ x € L and x' G L'}. 

We know that there is a simple example of -complete set L C T w accepted by a Biichi 
Turing machine. It is then easy to define an injective computable function from N into N such 
that, for every integer z G N, it holds that L(T 9 ( Z )) = {L ® T u ) U (F" ® L(T Z )). There are now 
two cases. 

First case. L(T Z ) = T w . Then L(Ter z )) = T" and L(Jqi z \) is unambiguous. 
Second case. L(7^) / T u . Then there is an w-word xeP such that x £ L(7^). But L(Tg( z )) = 
(L ® T w ) U (I™ ® L(7^)) thus {<r G | <r ® x G L(7e^))} = L is a £}-complete set. 
Thus L(7gr z \) is not Borel and this implies, by Theorem 14. 2 1 that L(Tgr z \) is in Max-Amb and in 
particular that L(7g( z )) is inherently ambiguous. 
We have proved, using the reduction 9, that : 

{z G N | L{%) = T^} <i {z G N | L(T Z ) is non ambiguous } 

Thus this latter set is -complete because the universality problem for o;-languages of Turing 
machines is itself n^-complete, see ||CC89l|FTn09bl . □ 

Theorem 4.10 The set {z G N | L(T Z ) G Max-Amb} is ^-complete. 

Proof. We first show that the set {z G N j L(T Z ) G Max-Amb} is in the class Sjj. In a similar 
way as in the proof of Proposition 13. 3 1 we consider the set R z defined by: 

R z = {(w, r) | w G Y u and r G {0, 1} W is an accepting run of T z on the w-word w}. 

This set R z is a A^-subset of T w x {0, 1} W . Notice that the set of accepting runs of T z on an 
w-word w G T w is the section 

Rz, w = {re{0,l} w I (w, r) G R z }- 

It is a set in the relativized class X{(to) and thus it is uncountable iff it contains a point tq such 
that {vq} is not a A}(to)-subset of {0, Moreover we have already seen that the set 

D z = {w j w G T w and there are uncountably many accepting runs of T z on w}. 

is a S}-set. Thus it is uncountable iff it contains a member which is not in class A}. Recall now 
that Harrington, Kechris and Louveau obtained a coding of A^-subsets (respectively, of A\(w)- 
subsets) of {0, 1} W in [H KL90L (see the proof of the above Theorem 14.4b . Then there is a n}- 
formula @i(w) such that for every w G P it holds that {w} is in the class A\ iff 81(10) holds. 
And there is a n}-formula 82(10, r) such that for every ioGT and r G {0, 1}" it holds that {r} 
is in the class A}(u>) iff 82(10, r) holds. We can now express the sentence "the set of w-words, 
which have 2 H ° accepting runs by T z , has cardinality 2^°" by the following formula Q,(z): 

3w3r[-<@i(w) A ->6 2 (io, r) A (w, r) G R z ] 

This formula Q(z) is clearly a £}-formula. We can now express the sentence U L(T Z ) G Max-Amb' 
by the following sentence: 

Vzf G N[L(T Z ) + L(T Z >) or n(z')] 
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This is a X^-formula because "L(T Z ) ^ L{T Z ')" is easily espressed by a Sg-formula (see the proof 
of Theorem 14. 91 ), the formula Q(z) is a X^-formula, and the first-order quantification Mz' bears on 
integers. Thus we have proved that the set {z G N | L{T Z ) G Max-Amb} is in the class H\. 

To prove the completeness part of the theorem we can use the same reduction 9 as in the proof 
of the preceding theorem and the fact that the universality problem for o;-languages of Turing 
machines is Il\ -complete. □ 

We now briefly recall some notions of set theory which will be useful for the next result and 
refer the reader to a textbook like HJec021 for more background on set theory. 

The usual axiomatic system ZFC is Zermelo-Fraenkel system ZF plus the axiom of choice 
AC. The axioms of ZFC express some natural facts that we consider to hold in the universe of 
sets. A model (V, G) of an arbitrary set of axioms A is a collection V of sets, equipped with the 
membership relation G, where "x G y" means that the set x is an element of the set y, which 
satisfies the axioms of A. We often say " the model V" instead of "the model (V, G)". 

We say that two sets A and B have same cardinality iff there is a bijection from A onto B and 
we denote this by A rj B. The relation « is an equivalence relation. Using the axiom of choice 
AC, one can prove that any set A can be well-ordered so there is an ordinal 7 such that A « 7. 
In set theory the cardinal of the set A is then formally defined as the smallest such ordinal 7. The 
infinite cardinals are usually denoted by Hq, Hi, K2, • • • , • • • The continuum hypothesis CH 
says that the first uncountable cardinal Hi is equal to 2 K ° which is the cardinal of the continuum. 

If V is a model of ZF and L is the class of constructible sets of V, then the class L is a model 
of ZFC + CH. Notice that the axiom V=L, which means "every set is constructible", is consistent 
with ZFC because L is a model of ZFC + V=L. 

Consider now a model V of ZFC and the class of its constructible sets L C V which is another 
model of ZFC. It is known that the ordinals of L are also the ordinals of V, but the cardinals in 
V may be different from the cardinals in L. In particular, the first uncountable cardinal in L is 
denoted H}% and it is in fact an ordinal of V which is denoted uj\. It is well-known that in general 
this ordinal satisfies the inequality < u\. In a model V of the axiomatic system ZFC + V=L 
the equality u:\ = uj\ holds, but in some other models of ZFC the inequality may be strict and 
then uj\ <uj\. 

The following result was proved in [Fin09a]. 

Theorem 4.11 There exists a real-time 1-counter Biichi automaton A, which can be effectively 
constructed, such that the topological complexity of the uj-language L(A) is not determined by 
the axiomatic system ZFC. Indeed it holds that : 

(1) (ZFC + V=L). The co-language L(A) is an analytic but non-Borel set. 

(2) (ZFC + co\ < oj\). The uj-language L(A) is a U° 2 -set. 

We can now state the announced result. 

Theorem 4.12 (ZFC + uj\ < uj\). There exists an co-language accepted by a real-time 1-counter 
Biichi automaton which belongs to the Borel class IlS, and which has the maximum degree of 
ambiguity with regard to acceptance by Turing machines, i.e. which belongs to the class Max- 
Amb. 

Proof. Consider the real-time 1-counter Biichi automaton A given by Theorem 14.1 li It may be 
seen as a Turing machine which has an index zq so that L(A) = L(T Z0 ). Let now V be a model 
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of (ZFC + < uj{). In this model L(A) is a Borel set in the class IT^. We are going to show that 
it is also in the class Max-Amb. 

Consider the model L which is the class of constructible sets of V. The class L is a model of 
(ZFC + V=L) and thus by Theorem 14.1 li the w-language L(A) is an analytic but non-Borel set in 
L. Then it follows from Theorem 14.21 that in L the ^-language L(T ZQ ) is in the class Max-Amb. 
On the other hand, the set {z G N | L(T Z ) £ Max-Amb} is a E^-set by Theorem l4~T0l Thus by 
the Shoenfield's Absoluteness Theorem (see HJec02l page 490]) this set is the same in the model V 
and in the model L. This implies that the w-language L(A) = L{T Z0 ) has the maximum degree 
of ambiguity with regard to acceptance by Turing machines in the model V too. □ 

Remark 4.13 In order to prove Theorem \4. 12\ we do not need to use any large cardinal axiom or 
even the consistency of such an axiom, because it is known that (ZFC + ui\ < UJ\) is equiconsistent 
with ZFC. However it is known that the existence of a measurable cardinal ( or even of a larger 
one), or the axiom of analytic determinacy, imply the strict inequality u\ < u\ and thus the 
existence of the uo-language in the class Max-Amb given by Theorem \4.12\ 

5 Concluding remarks 

We have investigated the notion of ambiguity for recursive o;-languages. In particular Theorem 
14.41 gives a remarkable dichotomy result for recursive o;-languages: a recursive o;-language L is 
either unambiguous or has a great degree of ambiguity. 

On the other hand, Theorem 14.121 states that it is consistent with ZFC that there exists a re- 
cursive w-language which belongs to the Borel class n!] and which has the maximum degree 
of ambiguity. The following question now naturally arises: "Does there exist such a recursive 
o;-language in every model of ZFC ?" 
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